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Abstract — Accurate device free localization (DFL) based on 
received signal strength (RSS) measurements requires placement 
of radio transceivers on all sides of the target area. Accuracy 
degrades dramatically if sensors do not surround the area. 
However, law enforcement officers sometimes face situations 
where it is not possible or practical to place sensors on all sides of 
the target room or building. For example, for an armed subject 
barricaded in a motel room, police may be able to place sensors in 
adjacent rooms, but not in front of the room, where the subject 
would see them. In this paper, we show that using two ultra- 
wideband (UWB) impulse radios, in addition to multiple RSS 
sensors, improves the localization accuracy, particularly on the 
axis where no sensors are placed (which we call the a;-axis). 
We introduce three methods for combining the RSS and UWB 
data. By using UWB radios together with RSS sensors, it is still 
possible to localize a person through walls even when the devices 
are placed only on two sides of the target area. Including the 
data from the UWB radios can reduce the localization area of 
uncertainty by more than 60%. 

Index Terms — Device-free localization, ultra-wideband, radio 
tomographic imaging, hidden Markov model, bistatic radar 

I. Introduction 

Device free localization (DFL) systems can be used in tacti- 
cal operations or crisis situations to help emergency personnel 
know where people are in a room or building before they 
enter [1|. These systems do not require people to participate 
in the localization effort by wearing or carrying sensors 
or radio devices. Radio frequency (RF) measurements are 
appropriate for emergency operations because RF penetrates 
(non-metal) walls. Such through- wall sensing could benefit 
police during hostage or barricade situations. However, in 
many such situations, it is not possible to place sensors on 
all sides of the building or area. For example, some sides 
of a building might have windows where an armed subject 
may be watching, and deploying sensors on that side could 
expose police to harm or escalating the situation. As another 
example, a room on an upper floor of a building may have 
some accessible interior walls (e.g., in a hallway), but the 
exterior wall may be unaccessible simply because of its height. 
This paper provides a system that expands the possibilities for 
RF-based DFL systems where an area cannot be surrounded 
with sensors by combining RSS-based DFL methods with 
bistatic ultra-wideband (UWB) impulse radar methods. 

We are particularly motivated by discussions with our local 
SWAT team, who have unfortunately, faced three situations in 
as many years in our metro area [2|, |3|, [4| in which hostages 



were taken by a barricaded subject in a hotel or motel room. 
Knowing the location of the suspect represents very valuable 
information in planning the actions (e.g., forced entry) required 
to bring the standoff to an end safely. In such situations, 
sensors could be placed in adjacent rooms to the barricaded 
room, but rooms have front windows, and sometimes back 
windows, thus front and back walls are potentially off-limits. 

A DFL system based on received signal strength (RSS) 
measurements [5|, |6|, |7|, |8| typically has radio transceivers, 
which we call RSS sensors placed on all four sides of a target 
area. Measured RSS between every pair of sensors are used to 
estimate the location of the person in the room in real-time. 
The localization process is based on models for the change 
in RSS introduced by the presence of a person on or near 
the link line, i.e., the straight imaginary line connecting the 
transmitter and receiver |6), (9), 1101 . When RSS sensors are 
placed only on two opposite sides of a room, the links cross 
the monitored area along one axis but not the other. This 
significantly degrades the localization accuracy of the system, 
especially along the axis with no crossing links [6|. 

UWB radios can be used for DFL through walls and 
can be accurate on the order of centimeters or tens of cm 
ifTTI . Ifl2l . Multiple UWB radios cooperating in a multistatic 
radar configuration can provide an unambiguous localization 
estimate iTTTI . A transmitter broadcasts a UWB impulse and 
receivers capture the time-domain channel impulse response 
(CIR) of the environment. Changes to the CIR are detected 
and the time delay beyond the line-of-sight (LoS) pulse for 
each of these changes is used to estimate the range of the 
target from the radios [13|. These radios, however, can be 
prohibitively expensive to install on a permanent basis: a single 
UWB impulse radio can cost thousands of dollars, and using 
only a single pair of radios provides insufficient information 
to unambiguously localize a target. 

In this paper, we introduce a joint DFL system that uses 
the changes measured in RSS and CIR to localize and track 
a target, such as a person, through walls. We demonstrate, in 
particular, the localization accuracy of a system which deploys 
sensors only on two opposite sides of a room. We call the axis 
parallel to the sides of the room without sensors the X axis and 
the axis parallel to the sides of the room with sensors the Y 
axis (see Figures [T] and [2]). The RSS sensors primarily provide 
the information about the target's y coordinate, while the 
UWB radios primarily provide information about the target's 
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x coordinate. This removes the need to have RSS sensors on 
all four sides of a target room and reduces the number of 
UWB radios required for localization. We introduce methods 
to process and combine the RSS and CIR data in order to 
provide a unique position estimate. We also propose methods 
that do not require an initial calibration of the system. The 
experimental results collected in two deployments, i.e., a study 
room at the University of Utah and a motel room in Salt 
Lake City, show that the joint RSS-UWB DFL system can 
accurately localize a non-cooperative target through walls. 
Even when the number of deployed devices is low, e.g, only 
two UWB radios and six (three per side) RSS sensors, the 
system can still provide a position estimate accurate enough 
to reliably indicate in which part of the room the person is 
located. In tactical situations where the only opportunity to 
have access to the target room is to open a breach in a wall 
with an explosive frame, this information can be used by police 
forces to decide which wall has to be detonated and avoid 
hurting or killing the suspect. 

In tactical operations or crisis situations, law enforcement 
may not have the possibility of calibrating the systems used for 
DFL in stationary conditions, i.e., when no person is located in 
the target area. The methods used to process the data coming 
from the RSS sensors and UWB radios should be able to 
localize and track the suspect in the room from the start, 
making DFL a plug-and-play type of system. In this paper, 
we propose novel variance-based methods for RSS and CIR 
measurements that can localize the person without requiring 
an initial calibration of the system in stationary conditions. 

At the time of writing, there are several commercially 
available through-wall radio technologies that can help law 
enforcement determine the position of people inside a room. 
The Prism200 from Cambridge Consultants fBl is a through- 
wall radar system for determining the location and movement 
of people for law enforcement or emergency personnel. The 
Xaver™ products from Camera are also through-wall UWB 
solutions that provide similar capabilities [15|. Time Domain 
is another company that offers solutions for target localization 
and tracking using UWB radios ifTBI . The UWB radios used in 
this work are a pair of P220 UWB radios from Time Domain. 
Compared to these products, the joint RSS-UWB DFL system 
described in this paper is considerably less expensive, as the 
RSS sensors cost few tens of dollars each and only two 
UWB radios are required. Moreover, the compact size and 
low weight of the RSS sensors and UWB radios make our 
system easier to be installed. 

The paper is organized as follows. In Section[1TJ we describe 
the radio tomographic imaging (RTI) technique used to process 
the RSS measurements coming from the RSS sensors. In 



different methods. Conclusions are given in Section VIII 



Section III we describe the methods used for estimating the 



bistatic range of a target using UWB radios by modeling 
the changes to the CIR as a hidden Markov model. Section 
HVI introduces three methods to combine the RSS and CIR 
data in order to provide a unique position estimate. Section 



II. Radio Tomographic Imaging (RTI) 

In RTI, originally introduced in [6], radio transceivers 
placed at known positions form a wireless mesh network and 
collect RSS measurements that can be used to localize and 
track a person in real-time without requiring her to wear or 
carry any sensor or radio device. RTI can provide sub-meter 
localization accuracy, also in through-wall scenarios HI, ifTTl . 
IT51 . The RSS measurements of all the links of the network 
are processed in order to estimate a discretized image x of 
the change in the propagation field of the monitored area 
caused by the presence a person. The estimation problem can 
be defined as: 

y = Wx + n, (1) 

in which y and n are L x 1 vectors of the RSS measurements 
and noise of the L links of the network, respectively, and x 
is the TV x 1 image to be estimated, where N is the number 
of voxels of the image. Each element x n of x represents the 
change in the propagation field due to the presence of a person 
in voxel n. The L x N weight matrix W represents a spatial 
impact model between the L links of the network and the 
N voxels of the image. The model used in RTI flj, (6), |9), 
IfTTl is an ellipse having the foci located at the transmitter 
and receiver of the the link. The voxels located within the 
ellipse have their weight set to a constant which is inversely 
proportional to the root distance between the transmitter and 
receiver, while the voxels located outside of the ellipse have 
their weight set to zero. 

A. Attenuation-based RTI 

For attenuation-based RTI, we use the method introduced 
in IfTTl . In this section, we briefly present this method and the 
terminology that will be used also in the following sections. 

The RSS of link I on channel c at time instant k, n c (fc), 
can be modeled as: 

ri, c (k) = P c -Li jC -Si lC (k) + F ltC (k)- Vl}C (k), ceJ(2) 

where P c is the transmit power of the sensors, Li, c the 
large scale path loss, Si }C the shadowing loss, Fi, c the fading 
gain (or fade level ifTOll ). r\i^ c the measurement noise, and 
T = {1, . . . , H} is the set of measured frequency channels. 
Both the large scale path loss L\ c and the shadowing loss 
Si, c change very slowly with the center frequency. In our 
experiments, we use IEEE 802.15.4-compliant transceivers 
lfT9ll which may transmit in one of 16 channels across the 
2.4 GHz ISM band. Because the band, 80 MHz, is small 
compared to 2.4 GHz, we can assume that both L; c and Si c 
are independent of the frequency channel c. Consequently, F/ c 
can be calculated as: 



Fi, c (k) = ri, c {k)-P c + ri ltC (k). 



(3) 



VI describes the experiments carried out, while Section VII 



presents the results and compare the performance of the 



Due to the measurement noise ry;. c , the fade level can not be 
measured directly. Thus, we estimate it by using the average 
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RSS, ri c , measured during an initial calibration of the system 
performed when no person is in the monitored area: 

Ft c = f/ )C - minf; c . (4) 

c 

In (10 1, the links are divided in anti-fade and deep fade 
links depending on the change in RSS measured when a person 
crosses the link line, i.e. the imaginary straight line connecting 
the transmitter and receiver. A link is in a deeper fade on 
channel c\ than on channel C2 if f/, Cl < n jC2 . By definition, 
Ft c > and Fi c = for one channel c on each link. Anti- 
fade links are the most informative for localization, since their 
spatial impact area is limited around the link line, while deep 
fade links measure a consistent change in RSS even when the 
person is far from the link line. For this reason, for each link 
I we calculate the fade level in Q of each channel c e T , 
and we rank the measured frequency channels from the most 
anti-fade to the most deep fade. If Ai is the set of size m 
containing the indices of the m top channels in the fade-level 
ranking, the link RSS measurement yi at time fc is calculated 
as: 

V^ k ) = ~ E A? v(fc), (5) 

ceAi 

where Ar; )C (fc) = r^ c (k) — f^ c , i.e., Ari yC (k) is the difference 
between the current RSS measurement and the average RSS 
measured during the initial calibration phase. 

B. Variance-based RTI 

We present a new multi-channel version of variance-based 
RTI extending and improving the results of (TJ. In this new 
method, we also include the concept of fade level. We calculate 
the mean of the RSS over a longer time window in order to 
better estimate the fade level of each frequency channel, and 
we use this mean to calculate the RSS short-term variance. 

The attenuation-based RTI method in ifTTl requires an initial 
calibration of the system in stationary conditions, i.e., when 
the monitored area is empty. Moreover, if the environment 
changes, e.g., when the suspect in the room moves furniture or 
other objects, the RTI system would need to be recalibrated or 
would otherwise lose accuracy. The works in 1181 . [?] address 
this issue and introduce methods capable of estimating the 
baseline RSS of the links on-line. 

In tactical operations, such as when an armed person has 
barricaded himself in a house or motel room before the arrival 
of police forces on the scene, we cannot expect to require 
an empty area. Variance-based RTI can be applied in this 
scenario. The change in RSS due to the presence of a person on 
the link line can be quantified as the unbiased sample variance 
of the last N s RSS measurements: 

1 

si, c (k) = _ E ( r iA k -P) ~ W.cM) , (6) 

where: 



TABLE I 

RTI IMAGE ESTIMATION PARAMETERS 



Description 


Parameter 


Value 


Voxel width [m] 


P 


0.15 


Ellipse excess path length [m] 


A 


0.02 


Voxels' variance [dB] 


°Z 


0.05 


Noise standard deviation [dB] 




1 


Voxels' correlation distance 


6c 


4 


Number of selected channels 


rn 


3 


Short-term RSS variance window 


N s 


5 


Long-term RSS mean window 


N» 


50 


Empty area intensity threshold 


T e 


0.05 


Number of updates for confirmation 




8 


Confirmation window 


H 


15 


Gating area radius [m] 


U! 


1.2 



is the mean of the last RSS measurements of link I on 
channel c, where > N s so to estimate the mean RSS of 
each channel on a longer time window and filter the changes 
due to the person crossing the link line. Variance-based RTI 
does not require an initial calibration of the system and can 
adapt at run-time to eventual changes in the environment. For 
each link I, fii lC (k) in ^ provides an estimate of the fade 
level of channel c at time k. As for attenuation-based RTI in 
Section |II-A| the channels are ranked from the most anti-fade 
to the most deep fade. The link measurement yi at time k is 
calculated as: 

yi(k) = — V hAk). (8) 

m L — ' 

ceAi 

C. RTI image estimation 

Since the number of links L is considerably smaller than 
the number of voxels N, the estimation of the image x 
is an ill-posed inverse problem that can be solved through 
regularization. In this work, we use a regularized least-squares 
approach |9|, |20|, IfTTl . fl8l . The discretized image of the 
change in the propagation field of the monitored area is 
calculated as: 

x = n y , (9) 

where y = [y 1: . . . , y L ] T , and 

n = (w T w + c- 1 _1 w T , (io) 

in which is the regularization parameter. The a priori 
covariance matrix C x is calculated by using an exponential 
spatial decay: 

[CJii = a 2 x e-^~ v ^, (11) 

where <j\ is the variance of voxel measurements, and S c is 
the voxels' correlation distance. The linear transformation II 
is computed only once before the system starts operating in 
real-time. The calculation of x in (|9| requires LxN operations 
and can be performed in real-time. Table [j] indicates the values 
of the parameters of the RTI image estimation process. 
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III. Ultra-wide Band Range Estimation 

Assuming an UWB transmitter sends a pulse S(t), each 
channel impulse response (CIR) is measured as: 



h{t)=^a l 5{t-T l ), 



(12) 



where and r$ are the complex amplitude and time delay 
of the ith multipath component, respectively. The line of sight 
path delay is tq. The path delay of the target, which we wish 
to estimate, is t* . We will consider a discrete-sampled version 
of the signal energy, r k : 



rfc 



\h(t)\ z dt, 



(13) 



'(fc-l/2)T 

where T is the sampling period and k ranges from 1 . . . M. 
In this work, T = 1 ns. From now on, CIR time delays will 
be considered only over discrete time intervals k rather than 
continuously on t, 

A. Changes to the CIR as a Hidden Markov Model 

The changes to the UWB CIR are modeled as a hidden 
Markov chain. We will refer to this method as HMM-UWB or 
hidden Markov model (HMM) based UWB. A hidden Markov 
chain is one whose states, X k = i, are not directly observable 
but are inferred from other observation signals, Ok, available 
from the system. The distribution of the observation signals is 
dependent on the state of the system, i.e., fo,i = P{0\X = i). 
To estimate the probability the system is in a given state 
at any time k, i.e., P(X k — z|0, A), we need to know 
the distributions of the observation signals, the initial state 
probabilities 7Tj, and the state transition probabilities, Pij, all 
of which are described by A = [iri, Pij, fo,i] EH- 

The observations, Ok, are the difference between the CIRs 
of the static environment and the CIRs of when a person is 
located in the monitored area. This difference is calculated 
as the symmetric Kullback-Leibler divergence, also known as 
relative entropy 11221 . Assuming a Gaussian distribution for 
over many CIRs, the closed form solution for the observed 
signal, Ok, is: 



O k 




1 (14) 



where \l v and are the mean and variance of r k during the 
calibration period, and fi q and a 2 q are the mean and variance of 
Tk when a person is located in the target area, respectively. The 
distribution of the observations is approximately a log-normal 
distribution [?]. 

If the changes to the CIR are modeled as a hidden Markov 
chain, the CIR goes from an unchanged state, X = 0, to a 
changed state, X = 1, at the time delay corresponding to 
the time traveled by the UWB pulse from the transmitter to 
reflect off of the target and then arrive at the receiving radio, 
i.e. fe*, which is equivalent to r*. By applying this model 
to the system, standard HMM solving algorithms, such as 
the forward-backward algorithm |2D . can be used to estimate 



when the system state changes and, thus, when changes to the 
CIR occur. The forward-backward algorithm determines the 
most likely state of the system at any given time as: 



X k = arg maxP[X k = i\0, A). 

i 

These state estimates are used to estimate k* as 



(15) 



P(X k = 1|0,A) < 0.5}fc* = {h\X k ^ IV* < fc*} (16) 

The work in [23 1 describes in further detail the method for 
estimating UWB bistatic range and its improved performance 
over other methods. From now on, we will let a k = P(Xk = 
l|O.A). a.k describes the probability those CIRs possibly 
affected by a person at time k are in the affected state. These 
probabilities are used to form the UWB localization image. 

When solving the forward-backward algorithm, accurate 
estimates of when state changes occur is dependent on how 
well A models the true system parameters. Often A, and 
particularly the distribution of observation signals, is unknown 
when beginning to solve the forward-backward algorithm. A 
known A from another environment can be used as an initial 
estimate for A when solving for the state estimates. With more 
observations and state estimates as time progresses, the Baum- 
Welch algorithm can help tune A to more closely match the 
true parameters and improve range estimates |2~T1 . (23). 

It is critical that the observation vectors are accurate repre- 
sentations of the difference between a CIR with a possible 
target and the CIR of the known environment. This is a 
function of how well the calibration CIRs represent the static 
environment. Even as a person is walking in a room, something 
in the environment may change that can significantly affect the 
accuracy of the range estimates due to a change in the static 
environment. This includes changing the position of furniture 
or a door opening or closing. If the bistatic range of the change 
to the environment is closer than the target, it will not be 
possible to estimate the range of the target using the existing 
calibration CIRs. Such a change to the environment requires 
that the calibration CIRs be updated to reflect this change, 
which is very difficult with a person moving within the target 
environment. This is a primary disadvantage of this hidden 
Markov model for detecting changes to the CIR. 

The most extreme case of choosing calibration CIRs would 
be to use the CIR samples that immediately precede the 
samples with a possible target. These calibration CIRs may 
include the target of interest as part of what is considered 
the static environment. This would minimize environmental 
variation that may occur over time and effectively eliminate 
the calibration requirement. This would also have a negative 
effect of biasing the target's range estimate toward the radios 
if the target is moving away from the radios and makes the 
detection of a non-moving target more difficult. However, if 
CIRs are sampled frequently enough relative to the person's 
speed, this bias toward the radios would be minimal. 

In this work, we assume there are no major changes to 
the environment throughout each trial that would require new 
calibration CIRs to be captured. This allows us to use just one 
calibration period for estimating fc*. 
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TABLE II 
UWB ESTIMATION PARAMETERS 



A. Image Combination 



Description 


Parameter 


Value 


Voxel width [m] 


P 


0.15 


Sampling Period [ns] 


T 


1 


Short-term CIR variance window 


N v 


5 


Variance normalization parameter 


P 


VN V 



B. Variance-based UWB Range Estimation 

An alternative method is to use the short-term variance of 
the CIR for each r k . We refer to this method as VB-UWB, or 
variance-based UWB. a k is calculated as: 

a k = ^, (17) 

9r k 

where the variance erf is the unbiased sample variance of 
r-fc over the Njj most recent CIRs. In this work, we let 
Nu = 5, corresponding to the number of CIRs captured 
in approximately 0.5 s. The normalization coefficient g is 
calculated as: 

g = g(l-0) + r k p. (18) 

This is equivalent to applying a low-pass IIR filter to r k . In 
this work, /3 = ~. Because the variance of r k is high when the 
mean of r k is high and vice versa, we normalize the variance 
erf by the mean of r k . In this way, a k increases only when 
the person moves. This method is used in conjunction with 
the variance -based RTI method described in Section Hi- Al The 
primary advantage of this method is that no calibration is 
required to solve for a k . A disadvantage is that the target 
can disappear if it remains motionless over a long period of 
time. We alleviate this problem by applying the methods in 
Section [VJ 

C. UWB Image Estimation 

When estimating the UWB image, the image space is 
constrained to contain only the inner dimensions of the target 
room plus one additional voxel on each image edge. Discretiz- 
ing the image space into N voxels, the image vector is: 



1" 



(19) 



where each voxel i™ has a bistatic range to the UWB trans- 
mitter and receiver described by its path delay k n . The value 
of each voxel, 1^, is calculated as the non-negative difference 
function: 

11 = (a kn - a fen _i) + (20) 
where the non-negative difference function is defined as: 



(x) 







if x > 
if x < 



(21) 



and assuming a n = 0. 

IV. Combining RTI and UWB Information 

In this section we introduce three methods to combine RTI 
and UWB and we compare the results of the different methods 
in Section IVIII 



An RTI image is formed as described in Section II-A after 
every RSS sensor has transmitted a packet on all channels in 
T , i.e., after RSS measurements have been collected on all the 
links and channels. A UWB image is formed for every new 
CIR captured. The two images are combined to form the new 
image L c by performing a voxel-wise product, 



r a r. 



(22) 



where V = x from Equation ^ and l u is from the UWB 
image 1". We define — maxL c . When no person is 

located in the monitored area, Ml,c has a very low value. 
We use a threshold T e to avoid further processing images not 
showing the presence of a person in the target area: if M^c < 
T e , we discard the current combined image and wait for the 
next one formed by the system. Otherwise, we normalize the 
values of the voxels of V and l u such that their minimum 
value is zero and the sum of all voxels is one: 



and similarly for l l 



V 



Ef =1 v 



J a 

6 I 



(23) 



(24) 



The normalization brings the two images in the same range of 
values and weights them equally. The normalized combined 
image L c is calculated again by performing a voxel-wise 
product of V and l u : 



l c = r a r. 



(25) 



The RSS and UWB data collected by the two systems are time 
stamped to allow synchronizing the two images. Images are 
formed at the same rate as the higher of the two sampling rates. 
In our case, since the UWB CIRs are sampled more frequently 
than each RTI cycle, a combined image is formed for each new 
UWB sample. This image will then be the combination of the 
most recently formed RTI with the new UWB image. From 
the normalized combined image, L c , the position of the person 
is estimated as: 



V 



are; maxiv c , 



(26) 



i.e., the person's position estimate is at the voxel n having the 
highest value. 

B. Linear Inversion with UWB Data 

An alternative method to form a combined localization 
image is to modify the weight W matrix in ([T]i to include 
the UWB measurements in the inversion process. We define 
the matrix Wp is an M x N matrix where M is the maximum 
value of k and is the number of voxels of the image. Each 
column of W^r represents the ideal vector of a k if the target 
were located at voxel p. The vector yu is the estimated vector 
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of afc from the results of the forward-backward algorithm. 
Equation ([T} then becomes: 



(27) 



where the subscripts R and U correspond to the matrices de- 
rived from the RSS or UWB data, respectively. The inversion 



















x + 





matrix is calculated as described in Section II-C using the 
combined W matrix. A combined localization image L c is 
then formed by multiplying the inversion matrix in ( 
the combined RSS and UWB measurement matrix y 
position of the person is estimated as in d26t, 



TO) to 

; . The 



C. Estimating X by using Y 

Another method for combining the UWB and RTI images 
is to derive one coordinate of the position estimate of a target 
from each image. First, we estimate the target location from 



the RTI image formed as described in Section II-A The y- 
coordinate from this position estimate is used to derive an 
jc-coordinate from the UWB image, which is calculated as 



described in III-C If the target location estimate from the 
RTI image is at coordinates (x R ,y R ), we consider the row 
of the UWB image corresponding to y R . The target position 
estimate p is set at the voxel having the maximum value in 
that row, i.e., p 



(x u ,y R ). 



V. Localization and Tracking 

The position estimate p is used for updating an already 
existing track of a person or for initiating a new one if the 
target area is empty. To this purpose, we use track confirmation 
and deletion rules l24l . If at time k the set of candidate tracks, 
Td, and the set of confirmed tracks Tf are both empty, i.e., the 
position estimate p(k) is used to start a new candidate track, 
which is added to Td- A candidate track becomes a confirmed 
track only if its position has been updated at least h app times 
in the last H formed combined images (h app < H). If this 
condition is not fulfilled, the candidate track is deleted. 

A gating area of radius u> is centered at the target's position 
estimate p. The radius cj is defined as an integer multiple of 
the voxel width p. If one or both of the sets Tf and T c are 
not empty, only the tracks located within the gating area are 
considered for a potential association to the current position 
estimate. These tracks form the set T g C (7/ U T c )- The 
confirmed tracks in T g are given priority over the candidate 
tracks: the current position estimate is used to update the 
closest confirmed track. Otherwise, if no confirmed track 
exists, the current position estimate is used to update the 
closest candidate track. If the set T g is empty, the current 
position estimate is used to start a new candidate track. By 
using the gating area, we avoid the position estimate of the 
person to have large sudden changes in correspondence of 
noisy RSS and CIR measurements from the two systems. 

VI. Experiments 

The first experiment was conducted in a study room on 
the second floor of the Warnock Engineering Building at the 




Fig. 1. Layout of the study room located in the Warnock Engineering 
Building at the University of Utah used for the experiments. Xs represent 
the 33 RSS sensors. Stars represent the 2 UWB radios. Circles represent the 
steps taken by the person at one second intervals. Grey rectangles represent 
furniture. The target room's inner dimensions are 3.82 m by 5.49 m (21 m 2 
area). 



University of Utah. A total of 33 RSS sensors were placed 
outside of the room along two opposite walls, 17 on one side 
and 16 on the other. The sensors were 30.5 cm apart. Two 
UWB radios were placed on one of the two sides of the room 
where the RSS sensors were positioned. The UWB radios were 
1 m apart. A person walked along a predefined path six times, 
three times counter-clockwise and three times clockwise. The 
person entered and exited the room in each of the six trials. 
With the help of a metronome and markings on the floor, the 
person walked at a constant speed of 0.5 m/s. Figure [T] shows 
the setup of the tests carried out in the study room. 

The second experiment was conducted in a room of a motel 
in Salt Lake City, Utah. The layout of this room is described in 
Figure [2] This time, ten RSS sensors were placed along each 
of the walls separating the room from the adjacent ones. Two 
UWB radios were placed outside one wall of the target room. 
The experiments were conducted with the UWB radios at two 
different distances, 0.9 m and 2.7 m apart. A person walked 
along a predefined path at a constant speed of 0.5 m/s, entering 
and exiting the room each trial. There were no other rooms 
adjacent to the target room besides the two where sensors were 
placed. For the second experiment, a person walked the target 
path 18 times. Six of the trials were done with the UWB radios 
in configuration A and twelve in configuration B, represented 
by white stars and black stars, respectively, in Figure [2] A 
photo of the interior of the target room can be seen in Figure 

El 

VII. Results 

For the first experiment, 50 simulations were run using 
randomly selected subsets of S RSS sensors available on each 
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Fig. 2. Layout of the room of a motel located in Salt Lake City, Utah. Xs 
represent the RSS sensors. White and black stars represent the UWB radios 
in configurations A and B, respectively. Circles represent the steps taken by 
the person at one second intervals. Grey rectangles represent furniture. The 
target room's inner dimensions are 3.96 m by 7.11 m (28 m 2 area). 




Fig. 3. Photo of the interior of the motel room where the second experiment 
was conducted. 



side of the room. The density of sensors on each side of the 
target room is higher than what would be used in a typical 
deployment. Subset sizes for these simulations ranged from 
3 to 10 sensors per side. The same number of sensors were 
used on each side of the room. The same subset of sensors 
was used for each of the six trials and remained the same 
when UWB radio data was included for a given simulation. 
The gating algorithm described in Section [V] was applied in all 
simulations. Simulations were performed using AB-RTI, AB- 
RTI with HMM-UWB, VB-RTI, and VB-RTI with VB-UWB. 

Figure [4] shows the mean RMS localization error for each 
of the methods used measured. Each point on the figure is the 
error is averaged over the 50 simulations and 6 trials performed 
for this experiment using S sensors. 

The Y-axis error improves significantly with each additional 



sensor used on each side of the room. There is also little 
improvement in the Y-axis error as a result of including the 
UWB information. This is the expected behavior of the system. 
Variance-based methods show improvement in reducing Y er- 
ror over attenuation-based methods. The X-axis error improves 
as a result of including more sensors on each side of the room 
but not as greatly as does the Y-axis error. The improvement 
as a result of including UWB information, however, is much 
more significant and is also almost constant as a function of 
the number of RSS sensors. The localization error improves 
overall, that is, as a Euclidean distance, by 51 cm and 33 
cm, on average, for attenuation and variance-based methods, 
respectively. 

As a point of comparison, if a point in the room is selected at 
random at each time, the RMS L2 error is 2.94 m on average 
over the 6 trials. Errors for the X and Y axes by selecting 
random locations are 1.65 m and 2.44 m, respectively. The 
gating algorithm is not applied when performing estimating 
location using random coordinates. 

For the second experiment, 50 simulations were also run 
using randomly selected subsets of S RSS sensors on each 
side of the room for each simulation. When S = 10, however, 
only one simulation was performed because there was only 
one possible combination of S = 10 radios per side. For each 
simulation, localization is performed using AB-RTI, AB-RTI 
with HMM-UWB, VB-RTI, and VB-RTI with VB-UWB. The 
gating algorithm described in Section[V]is also applied to each 
of these methods. Figures [5] shows, from left to right, the L2, 
X, and Y errors when applying these four methods to the data 
collected over the 18 trials performed in the motel room. 

One noticeable difference between the results of the two 
experiments is that the Y error in the second experiment 
degrades significantly by including VB-UWB with VB-RTI 
whereas for the first experiment the Y error was effectively the 
same. Generally, however, the same trends are visible in the 
results for the second experiment. The Y error improves with 
increasing S and including UWB data significantly improves 
X error. 

For the second experiment, the error using 10 sensors per 
side is higher than the error using 7 sensors, in many cases. 
There were only 10 sensors on each side of the room and, 
therefore, only one unique simulation could be performed 
using 10 sensors. By performing many simulations using 
subsets of the available sensors, the effects due to sensor 
placement on localization error could be minimized. This 
was not possible in the case where S = 10 for the second 
experiment. 

Table [Hi] shows the mean RMS error over the 18 trials 
performed for this experiment using all 20 RSS sensors. 
For comparison and as an estimate of the upper bound on 
error for a given environment and target path, random image 
coordinates are selected as the target location estimate. At 
each time when a combined image would be formed, X and 
Y coordinates and randomly selected and are used as the 
location estimate at that time. The gating algorithm described 
in Section [V| is not applied when randomly choosing location 



8 




3456789 10 
S sensors per side 



1.6 t 





AB- 


-RTI 


- ▼- 


AB- 


-RTI & HMM-UWB 




VB- 


-RTI 


- 


VB- 


-RTI & VB-UWB 




3456789 10 
S sensors per side 



3456789 10 
S sensors per side 



Fig. 4. From left to right, the mean RMS L2, X, and Y errors over the 6 trials and 50 simulations using random subsets of S sensors per side of the study 
room. 
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Fig. 5. From left to right, the mean RMS L2, X, and Y errors over the 18 trials and 50 simulations using random subsets of S sensors per side of the motel 
room. When S = 10, only 1 simulation was performed. 



estimates. The results from applying the methods described in 
Sections IV-B andJWjC are also given in Table III 



Note in Table |IIl| that when performing localization using 
AB-RTI or VB-RTI, the X-axis error is about the same as 
that obtained from randomly guessing an X coordinate for 
each image. This is critically important for tactical operations. 
Having some knowledge about the person's coordinate in each 
axis is essential for law enforcement personnel to be able to 
make tactical decisions. 

A. Area of Uncertainty 

We define the area of uncertainty (AoU) as the ratio of the 
L2 mean squared error (MSE) to the total area of the monitored 
room: 

AoU=„ L2MSE . (28) 



Room Area 



Table IV shows the percent reduction in the AoU by adding 
UWB data to AB-RTI and VB-RTI for S = 3 and S = 10 



TABLE IV 

Percent reduction of AoU by including UWB data. 



Study Room 
AB-RTI VB-RTI 



Motel Room 
AB-RTI VB-RTI 



S ■ 

S : 



: 3 

10 



40.2% 
61.8% 



32.4% 
43.2% 



26.3% 
41.3% 



0.2% 
14.9% 



The percent reduction in the AoU is significant except for 
VB-RTI in the motel room using 3 sensors. This may be due to 
the particular subsets of sensors used in the simulations when 
5 = 3. The reduction in the AoU confirms that by adding 
UWB data the system can more accurately indicate to law 
enforcement personnel in which part of the room the person 
is located. 

VIII. Conclusions 

In this work, we present a joint DFL system that uses the 
changes measured in RSS and UWB CIR to localize and 
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TABLE III 

Mean RMS localization error for the second experiment over all 18 trials for the methods described. Gating was used for all 

METHODS EXCEPT RANDOM SELECTION. UNITS GIVEN IN METERS. 





AB-RTI 


AB-RTI & HMM-UWB 


VB-RTI 


VB-RTI & VB-UWB 


Inversion with UWB 


X from Y 


Random 


L2 


2.10 


1.59 


2.07 


1.91 


1.84 


1.76 


3.31 


X 


1.54 


0.73 


1.61 


1.16 


1.31 


0.98 


1.53 


Y 


1.42 


1.41 


1.30 


1.51 


1.28 


1.44 


2.94 



track a person through walls. We target tactical operations and 
crisis situations where it is not possible for the police forces 
to place sensors on all sides of the area to be monitored. 
Experimental results show that including UWB with RSS 
data significantly improves localization accuracy when RSS 
sensors are only available on two sides of the target area. 
Where RSS sensors have been placed along the Y axis, 
improvements in accuracy along the X axis by including UWB 
data are especially significant. Without including UWB data, 
the accuracy along the X axis can be as bad as randomly 
guessing an X coordinate. 

We introduce three methods to combine the information 
from the UWB and RSS systems and we compare their 
performance. The multi-channel variance-based RTI method 
proposed in this work, which does not require an initial 
calibration in stationary conditions, is as effective or more 
effective than attenuation-based RTI for through-wall local- 
ization. The improvements in localization accuracy and the 
reduction in the AoU demonstrate that UWB data should be 
included in a DFL system for tactical operations where RSS 
sensors may only be placed on two sides of a room. 
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